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Abstract 

We consider random systems of linear equations over GF(2) in which every equation binds k variables. 
We obtain a precise description of the clustering of solutions in such systems. In particular, we prove that 
with probability that tends to 1 as the number of variables, n, grows: for every pair of solutions a, r, either 
there exists a sequence of solutions ct, . . . ,r in which successive elements differ by O(logn) variables, or 
every sequence of solutions a, . . . ,t contains a step requiring the simultaneous change of i}{n) variables. 
Furthermore, we determine precisely which pairs of solutions are in each category. Our results are tight 
fvj ' and highly quantitative in nature. Moreover, our proof highlights the role of unique extendability as the 

driving force behind the success of Low Density Parity Check codes and our techniques also apply to the 
problem of so-called pseudo-codewords in such codes. 
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1 Introduction 

Random Constraint Satisfaction Problems (CSPs) have emerged as a mathematically tractable vehicle for 
studying the performance of algorithms and proof systems. In the most well-studied setting, one has a set 
of n variables all with the same (small) domain D and a set of to = 0(n) constraints, each of which binds 
a randomly selected subset of fc = 0(1) variables. Two canonical examples are random fc-SAT and coloring 
sparse random graphs. Such random CSPs have proven difficult for proof systems, e.g., in the seminal work 
of Chvatal-Szemeredi on resolution [7], and, more recently, for some of the most sophisticated algorithms 
known [3 [6]. 

A fundamental quantity in the study of random CSPs is the so-called constraint density, i.e., the ratio of 
constraints-to- variables a = m/n. In particular, for many NP-complete CSPs there is a critical density above 
which solutions provably exist, yet no known polynomial-time algorithm can find one. In [2 it was shown 
that for random /c-SAT and for random graph coloring this algorithmic breakdown coincides with the onset 
of solution clustering: the set of solutions can be partitioned into exponentially many sets (clusters) that 
have linear Hamming distance from one another. Moreover, in each cluster a large fraction of the variables 
are frozen, i.e., take the same value in all solutions in the cluster. There is on-going work, see e.g., [H], to 
establish that clustering is a universal phenomenon for random CSPs. 

Until now there has not been a precise description of the clusters for any random CSP. The main 
contribution of this paper is such a description of the clusters of fc-XOR-SAT. This has long been recognized as 
one of the most accessible of the fundamental random CSP models, in that researchers have managed to prove 
difficult results for /c-XOR-SAT that appear to be far beyond our reach for, e.g., random fc-SAT and random 
graph coloring. Perhaps the most notable result along these lines is Dubois and Handler's [21 determination of 
the satisfiability threshold for random fc-XOR-SAT. Before we present our results we present some background 
on sparse systems of random linear equations over GF(2) that puts our results in perspective. 

1.1 Random systems of linear equations 

We consider systems oi m = 0(n) linear equations over n Boolean variables, where each equation binds a 
constant number of variables fc > 3. (The case fc < 2 is trivial.) Clearly, deciding whether such a system 
has satisfying assignments (solutions) can be done in polynomial time by, say, Gaussian elimination. In fact, 
the set of solutions forms a subspacc. so that the sum of two solutions is also a solution. At the same time, 
it seems that if one fails to exploit the underlying algebraic structure everything falls apart. For example, 
if the system is unsatisfiable, finding a value assignment a that satisfies as many equations as possible, i.e., 
MAX XOR-SAT, is NP-complete. Moreover, given a satisfiable system and an arbitrary a G {0, 1}", finding 
a solution nearest to a is also NP-complete [3]. Finally, random fc-XOR-SAT (defined below) appears to 
be extremely difficult both for generic CSP solvers and for SAT solvers working on a SAT encoding of the 
instance. Indeed, very recent work strongly suggests that among a wide array of random CSPs, random 
fc-XOR-SAT is the most difficult for random walk type algorithms such as WalkSat [11] . 

Random fc-XOR-SAT, which we study here, is the case where each equation binds exactly fc variables. To 
form the random system of equations Ax = b we take A to be the adjacency matrix of a random fc- uniform 
hypergraph H with n variables and m edges and b € {0, 1}™ to be a uniformly random vector. To choose 
A we can either select exactly to out of the possible (T) edges uniformly and independently, or include 
each possible edge independently with probability p. (Results transfer readily betwen the two models when 
m — p(^').) In this paper, wc work with the latter model, which we denote A'fc(n,p). We will say that a 
sequence of events £"„ holds with high probability (w.h.p.) for such a system if lim„_5.oo Pr[fn] = 1. 

We note that as n — )• oo. the degrees of the variables in such a random system tend to Poisson random 
variables with mean km/n. This means that for any finite a = m/n, w.h.p. there will be 17 (n) variables of 
degree and 1. Clearly, variables of degree do not affect the satisfiability of the system. Similarly, if a 
variable v appears in exactly one equation e^, then, we can always satisfy e^ by setting v appropriately for 
any constant bi. Therefore, we can safely remove e^ from consideration and only revisit it after we have found 
a solution to the remaining equations. Crucially, this removal of e^ can cause the degree of other variables 
to drop to 1. This leads us to the definition of the core of a hypergraph. 



Definition 1. The r-core of a hypergraph H is the maximum subgraph of H in which every vertex has degree 
at least r. 

Trivially, removing any vertex of degree less than r from H does not change its r-core. Therefore, the 
r-core is the (potentially empty) outcome of the procedure: repeatedly remove an arbitrary vertex of degree 
less than r until no such vertices remain. 

As discussed above, any satisfying assignment to the 2-core variables can be readily extended to the 
remaining variables. We will see in Section [5] below that there is a natural heuristic argument which would 
lead one to guess that different satisfying assignments on the 2-core variables have Hamming distance J7(n). 
Indeed, in [161 rigorous but erroneous arguments were given for this. This guess motivates the following 
very simple definition of clusters: each cluster consists of all possible extensions to a given 2-core satisfying 
assignment. However, this guess is false, as we now describe: 

Definition 2. A flippable cycle in the 2-core of a hypergraph H is a set of vertices S = {wi, ...,«(} in the 
2-core with the following property: There is a set of edges ei, ..., et in the 2-core such that each vertex Vi lies 
in Ci and in e^+i and in no other edges of the 2-core, for all 1 < i < t (addition mod t). 

Thus, the vertices vi, . . . ,vt have degree two in the 2-core, whereas the remaining vertices in edges ei , . . . , Ct 
can have arbitrary degree (at least 2) in the 2-core. Note that these remaining vertices are not part of the 
flippable cycle. 

If a is any 2-core satisfying assignment, then flipping the value of all variables in a flippable cycle 
readily yields another satisfying assignment of the 2-core. It is not hard to show that the 2-core of a 
random hypergraph often contains short flippable cycles, implying that 2-core satisfying assignments may 
have Hamming distance 0(1), a far cry from the heuristic argument and the definition of clusters described 
above. 

We note that the number of vertices in flippable cycles in the 2-core has mean 6(1) (see Lemma [23]). 

In |16| it was also argued that for every pair a, t of extensions of the same 2-core assignment, there exists 
a sequence of satisfying assignments a,a' , . . . ,t such that successive assigments have Hamming distance at 
most d = 0{\). The argument in [16], though, has a grave flaw and in fact 0(logn/loglogn) is a lower 
bound on d (see Observation [4] below). This leaves open the question of how different 2-core extensions 
relate to one another (indeed, this was stated as an open problem in [TT].) That is, is it possible to convert 
one into another by small (in Hamming distance) steps? Also, what about inside the 2-core? Is it possible 
to travel between different 2-core assignments by flipping a few variables at a time? In this work we give an 
exact answer to both of these questions, fully resolving the cluster structure of random fc-XOR-SAT. 

Definition 3. Say that two solutions a, t: 

• Are cycle-equivalent if on the 2-core they differ only on variables in flippable cycles in the 2-core (and 
arbitrarily on variables not in the 2-core). 

• Are d-conncctcd if there exists a sequence of solutions cr, ct', . . . , r such that the Hamming distance of 
every two successive elements in the sequence is at most d. 

• Are d-disconnected if they are not d- connected. 

Let the cluster of a solution a consist of all its cycle- equivalent solutions. 

We first prove that in a sparse random system of linear equations, flippable cycles are w.h.p. the only 
source of connectivity between different 2-core assignments. That is, if two 2-core satisfying assignments 
differ on even one variable not lying in a flippable cycle, they must differ in n{n) variables. 

Theorem 1. For any constants c > and k > 3, there exists a constant a = a{c,k) > such that in 
Xk{n,p = c/n), w.h.p. every pair of solutions in different clusters is an- disconnected. 



In [TBj, Mezard et al. did prove that there exists a constant 7 > such that for any 9 E (0,7), no two 
solutions differ on On variables in the 2-core. However, they did not perform the crucial analysis for solutions 
that differ on o{n) variables. This explains why they missed the fact that some solutions differ on the o{n) 
vertices of a flippable cycle. The same erroneous statement appears in [17j : again the o(n) analysis is missing. 
Providing that analysis, and using it to show that 2-corc solutions which differ on o(n) variables must differ 
only on flippable cycles, is the most difficult part of our proof of Theorem [TJ 

In stark contrast, we prove that internally clusters are very well-connected. 

Theorem 2. For any constants c > and k > 3, there exists a constant Q = Q{c, fc) > such that in 
Xk{n,p = cjn), w.h.p. every pair of solutions in the same cluster are Qlogn- connected. 

Our proof of Theorem [5] is algorithmic, giving an efficient method to travel between any pair of solutions 
in the same cluster. Theorem [2] is nearly tight due to the following. 

Observation 4. W.h.p. every cluster contains g{n)- disconnected solutions, where g{n) = f2(log7i/ loglogTi). 

Proof. Consider any solution a to the 2-corc, and consider any two extensions cro,ai of a to the entire 
formula such that, for a specific non-core variable v, we have cro{v) — 0,cri(u) = I. Then (To,cri must differ 
in at least one additional variable in every equation containing v. Thus their Hamming distance is at least 
deg(ij) + I. The observation now follows readily since w.h.p. there are variables in the tree components of 
degree at least g{n) and every variable in a tree-component can take both values in extensions of any a. D 

Finally, it is worth pointing out that for a large range of densities, the 2-core of a random fc-uniform 
hypergraph is empty. For example, for fc = 3 the 2-corc is empty [inilll] for a < 0.818..., while unsatisfiability 
occurs [5] at a = 0.917... Since in the absence of a 2-core there is only one cluster, our Theorem [2] implies 
that in this regime the entire set of solutions is wcU-conncctcd. To prove Theorem [21 we draw heavily from 
the linear structure of the constraints, (i) showing that it is possible to identify a set of non-core variables 
to act as a basis for expressing all possible extensions to each 2-core solution, and with a lot of work (ii) 
showing that changing any basis variable can only affect the value of 0(log7i) other variables. 

So, in a nutshell, we prove that before the 2-core emerges any solution can be transformed to any other 
solution along a sequence of successive solutions differing in O(logn) variables. In contrast, as soon as 
the 2-core emerges, the set of solutions shatters into clusters defined by complete agreement on the 2-core 
variables not in flippable cycles: any two solutions that disagree on even one 2-core variable not in a flippable 
cycle, differ on J7(n) variables. At the same time, solutions in the same cluster behave like solutions in the 
pre-core regime, i.e., one can travel arbitrarily inside each cluster by changing O(logn) variables at a time. 

Our proof of Theorem [1] easily extends to all uniquely extendable CSPs. 

Definition 5 ([8]). A constraint of arity k is uniquely extendable if for any value assignment to any fc — 1 
variables there is precisely one value for the unassigned variable that satisfies the constraint. 

Linear equations over GF(2) and unique games are the two most common examples of uniquely extendable 
(UE) CSPs, but many others exist (see, eg. [8]). Clearly, any instance of a UE CSP <I> is satisfiable iff its 
2-core is satisfiable. Thus, it is natural to define clusters analogously to XOR-SAT, i.e., two solutions are in 
the same cluster if and only if their 2-core restrictions differ only on flippable cycles. Our proof of Theorem[l] 
applies readily to any UE CSP, yielding a corresponding theorem, i.e., that, under this definition of clusters, 
satisfying assignments in different clusters arc 51(n)-disconnected (see the remark following Proposition I55| . 
However, we do not know whether the analogue of Theorem [2] holds under this definition of clusters, i.e., 
whether it is possible to travel between cycle-equivalent solutions in small steps. Also, note that while in 
XOR-SAT changing all the variables in any flippable cycle results in another solution, this is not necessarily 
the case for every UE CSP <&. 



2 Earlier work, LDPC codes, and variables of degree 2 

Consider a system of linear equations over GF(2), Ax ~ b. Note first that if the rows of A are hnearly 
independent then the system has solutions for every choice of 6, while if the rows are dependent, then it only 
has solutions for certain choices of b. It follows that the satisfiability threshold coincides with the threshold 
for the rows of A to be independent; if we are above that threshold, then it turns out that the rows are 
highly dependent and only a vanishingiy small proportion of the choices for b will yield a solution. 

One can use this observation to get an easy upper bound of m/n < 1 on the satisfiability threshold for 
random fc-XOR-SAT by noting that if m > n then the rows must be dependent. As one can imagine, though, 
this condition is not tight since variables of degree and 1 only contribute fictitious degrees of freedom. The 
next most reasonable necessary condition for satisfiability then is nic ^ 't-c, where nc,rnc is the number 
of variables and equations in the 2-core, respectively. In |^ Dubois and Mandler proved that this simple 
necessary condition is also sufficient by proving that the (exponential in expectation) number of solutions 
for the 2-core is strongly concentrated around its expectation. 

Random sparse systems of linear equations are the backbone of modern coding theory as they underlie 
Low Density Parity Check codes (LDPC). To form such a code one first decides on its block length (the 
number of variables n), its rate (via the number of equations m), the fraction, Ai, of variables of degree 
i, i.e., that will appear in i equations, and the fraction, pi, of equations that will bind i variables (unlike 
fc-XOR-SAT, different equations can bind different numbers of variables). Then, to get a concrete code, one 
chooses a uniformly random hypergraph satisfying these requirements using the configuration model (see 
Section [S]), and sets its adjacency matrix to be A. The codewords then are the solutions to Ax = 0, since 
for any useful set of parameters, the equations are w.h.p. linearly independent, and so every choice of b is 
equivalent, making b — a convenient choice. 

Returning momentarily to random fc-XOR-SAT we note that, by standard results, the 2-core of a random 
fc-uniform hypergraph is itself a random fc-uniform hypergraph conditional on its degree sequence. Moreover, 
w.h.p. the number of variables of each degree i > 2 in the 2-core is 7^ •n-l-o(n), where 7^ is a well- understood 
deterministic function [531 [T5]. Therefore, we see that the 2-core of a random fc-XOR-SAT instance is itself 
an LDPC code with A^ = 7^ and Pi = k for all i. 

To get an idea of how systems of random linear equations give rise to codes with good (typical) distance 
properties, it suffices to make two basic observations. 

• Say that cr is a solution to a system as above, e is an equation in the system and w is a variable in e. If 
we flip 1), in order to satisfy e we must now change (at least) one other variable u in e. Consider now 
any other equations containing u,v. In each of these, if we change u or v, at least one other variable 
must change. And so on. This propagation of forced changes stops only when all relevant loops close. 

• If we insist that every variable appears in at least 3 equations, i.e., has degree at least 3, then the 
bipartite (factor) graph between variables and equations is an excellent expander. In particular, it is 
easy to prove that after changing a single variable the loops mentioned above close only after another 
f2(?T,) variables have been changed (see, eg. J15j). 

Clearly, in an error-correcting code there should be no variables of degree or 1 since such variables do 
not carry any information about the message being sent (as any assignment to the remaining variables can 
be extended to them, they terminate any propagation sequence reaching them). 

At the same time, it is natural to guess that if the minimum degree is 2, then the chains of propagations 
above will all be sufficiently large, motivating the (incorrect) definition of fc-XOR-SAT clusters as the sets 
of solutions that agree on the 2-core. However, that guess is wrong. Degree 2 variables complicate matters 
by reducing the explosiveness of the propagation process, giving rise to a small number of pseudo-codewords 
(solutions near the vector, ruining the code's worst-case distance). Moreover, we must tolerate this, because 
having a non-trivial fraction of degree 2 variables is crucial in making LDPC codes rate-efficient [T7|. As we 
discuss in the next section, degree 2 variables also play a crucial and completely analogous role in the setting 
of this paper. Our fiippable cycles give rise to pseudocodewords and Theorem [T] establishes that they are the 
only source of flaws in the distance properties of this code. To prove this theorem, we develop machinery 



that can be applied to hypergraphs with arbitrary degree sequences of minimum degree 2. This yields a 
quantitative analysis of pseudocodewords which we will present in another paper. 

3 Proof Outline 

3.1 Theorem [it Cluster separation 

Given a satisfying assignment a, a flippable set is a set of variables S such that flipping the value of all 
variables in S yields another satisfying assignment r. Proving Theorem [1] boils down to proving that w.h.p. 
every flippable set in the 2-core other than a flippable cycle has linear size. 

A common approach to proving analogous statements is to establish, dcterministically, that every flippable 
set must induce a dense subgraph. In particular, if one can prove that for some constant e > 0, every such 
set is at least 1 + e times as dense as a flippable cycle, then standard arguments yield the desired conclusion. 
Here, though, this is not the case, due to the possibility of arbitrarily long 2-linked paths (see Definition 
[3T] below) . Specifically, it is easy to see that by replacing every edge of a fiippablc set by a long 2-linked 
path, one can create a new flippable set whose density is arbitrarily close to that of a flippable cycle. Thus, 
controlling the number and interactions of 2-linkcd paths, an approach sinfilar to that of [Tl I20j. is crucial 
to our argument. Carrying out this analysis on hypergraphs with a given degree sequence, as we do here, 
requires the introduction of a number of technical innovations. 

The key to controlling 2-linked paths is to bound a parameter governing the degree to which they tend 
to branch. Lemma [20l shows that this parameter is bounded below 1, so while arbitrarily long 2-linkcd paths 
will occur, their frequency decreases exponentially with their length. This enables us to tame them. 

We note that if we were working on hypergraphs with minimum degree at least 3, then there would be 
no 2-linked paths, and the proof would have been very easy. All of our innovations were designed to deal 
with the problem of degree 2 vertices and our approach readily applies to arbitrary degree sequences of 
minimum degree 2. In particular, since our analysis is developed for arbitrary degree sequences it provides 
a new tool to attack the issue of pseudocodewords in LDPC codes, which we do in a separate work. This is 
an important point since, as mentioned above, having a linear number of vertices of degree 2 is an essential 
requirement for any such code to approach channel capacity. 

3.2 Theorem [2t Connectivity inside clusters 

Recall that we can reach the r-core of a hypergraph by repeatedly removing any one vertex of degree less 
than r, until no such vertices remain. 

Definition 6. An r-stripping sequence is a sequence of vertices that can be deleted from a hypergraph, 
one- at- a- time, such that at the time of deletion, each vertex has degree less than r. 

It is often useful to consider stripping the vertices in several rounds. 

Definition 7. The parallel r-stripping process consists of iteratively removing all vertices of degree less than 
r at once along with any hyperedges containing any of those vertices, until no vertices of degree less than r 
remain. 

It is easy to show using standard facts about r-corcs of random hypergraphs that for every constant 
e > 0, there is a constant T = T(e) such that in a random /c-uniform hypergraph, all but en of the non-core 
vertices can be removed by a stripping sequence of length at most T. What is significantly harder, and our 
main technical contribution in order to establish Theorem [21 is proving that w.h.p. all non-core vertices can 
be removed by a stripping sequence of length O(logn). (See Section 3] for a description of why this suffices 
to prove Theorem [21) We note that this result is of independent interest in random hypergraph theory and 
for that reason we prove it for arbitrary r > 2, even though we only need the case r = 2 case for Theorem [21 

To prove that all non-core vertices can be removed by a stripping sequence of length O(logn), our 
approach is significantly different below and above the threshold, c^ ,. for the emergence of an r-corc in 



random fc-uniform hypergraphs. In both cases, we begin by stripping down to Hb for a sufficiently large 
constant B. A simple argument shows that for any non-core vertex w, the number of vertices removed during 
this initial phase that are relevant to the removal of v, is bounded. What remains is to show that every 
non-core vertex in Hb can be removed from Hb by a stripping sequence of length O(logn). 

For c < c^^, we prove that there exists a sufficiently large constant B = B{c,k,r) such that after B 
rounds of the parallel r-stripping process, all connected components of the remaining hypcrgraph Hb have 
size at most W ~ 0(logri,); therefore, all remaining vertices can be removed with an additional W strips. 
To do this we establish analytic expansions for the degree sequence of Hb as B grows and then apply a 
hypergraph extension of the main result of MoUoy and Reed |19] regarding the component sizes of a random 
fc-uniform hypergraph with a given degree sequence. 

For c > cj ^, a lot more work is required. Once again, 2-linked paths are a major problem. Indeed, it is 
not hard to see that a long 2-linked path with one endpoint of degree 1, can create a long stripping sequence 
leading to the removal of its other endpoint. 

We first establish that for any e > 0, after a finite number B — B{e) of r-stripping rounds we are 
sufficiently close to the r-core for two important properties to hold in the remaining hypergraph Hb- (i) 
there are at most en vertices of degree less than r, and (ii) the "branching" parameter for 2-linked paths, 
mentioned above, is bounded below 1. Property (ii) allows us to control long 2-linked paths. However, this 
does not suffice as we need to control, more generally, for large tree-like stripping sequences. To do so, we 
note that any large tree must either have many leaves, or long paths of degree 2 vertices. Such long paths 
will correspond to 2-linked paths in the random hypergraphs, and so (ii) allows us to control the latter case. 
Leaves of the tree will have degree less than r, and so (i) enables us to control the former case. 

4 Selecting a basis for intra-cluster travel 

Given an r-stripping sequence that removes every vertex outside of the r-core, we associate with it a directed 
graph as follows: (i) let v be the next vertex in the sequence; (ii) remove the (no more than r — 1) edges Ey 
containing v; (iii) add arcs to v from every vertex other than v in Ey. Note that if Ey contains a variable 
w ^ V oi current degree 1, w will end up having in-degree 0, as the edges in Ey are erased by the processing 
of V. More generally, no arcs are associated with the removals of degree variables. So the digraph will 
include all vertices that are not in the r-core, as well as any r-core vertex that shares an edge with a vertex 
not in the r-core. 

For every vertex v in the digraph, we define R'^{v) to be the set of vertices that can be reached from v. 
The following lemma is the heart of our proof of Theorem [5] and its proof occupies Sections [7] and [51 after 
we set out some basic facts about cores in Section [5] and some basic calculations in Section [51 

Lemma 8. Fix integers k,r and let H ~ Hk{n,p) be a random k-uniform hypergraph on n vertices with 
p = c/n . There exists a constant Q > such that w.h.p. there is an r-stripping sequence of H culminating 
with the r-core such that in the digraph D associated with the sequence: 

(a) For every vertex v, \R^{v)\ < Qlogn. 

(h) For r — 2, for every fiippable cycle C , 



Y.\R+{v)\< Qlogn 



vec 

We note that, via standard arguments, a number of earlier works implicitly establish that for any fixed 
6* > 0, there exists Z = Z{9) such that |_R+(u)| < Z for all but On vertices v. The difficulty here lies in 
proving a bound on |i?+(w)| that holds for all vertices. 

Remark: For our purposes, we only need Lemma |S] for the case r = 2, fc > 3. However, this lemma is of 
independent interest to the study of random hypergraphs, and so we prove it for all r, fc > 2. 

Remark: The proof of Lemma [5] can be extended to show that w.h.p. for every vertex v £ D, the 
subgraph induced by |i?^(w)| has at most as many arcs as vertices. 



Proof of Theorem [H Given an arbitrary system of linear equations consider any 2-stripping sequence w i , . . . , wj , 
of its associated hypergraph which strips all the way to the 2-core. Let D be the digraph associated with 
the sequence. Let ei, 62, . . . , et be the corresponding sequence of removed hypcrcdgcs, i.e. e^ was the only 
edge containing Vi when Vi was removed. Let B be the set of non-core vertices with indegree zero in D. 

We will perform Gaussian elimination in an manner such that for any satisfying 2-corc assignment, cr, 
B is a basis for the subspacc formed by all satisfying extensions of a. Specifically, we consider the removed 
hyperedges in reverse removal order, i.e., et, et-i, . . . , ei. Each time we consider a hyperedge e^, we express 
Vi in terms of the other variables in e,;. Each of these other variables is either in B or in the 2-core, or is 
already expressed in terms of variables in B and the 2-core. So, in the end, each non-core variable u ^ B is 
expressed in the form u = X^tueyfu) ^' '^here xl"") is a set containing variables from the 2-core and B that 
have a path to u in D. 

More specifically, for each v E B and for each v in the 2-core, we set x{v) = {v}. Every non-core variable 
not in B has indegree at least 1 and so is Vi for some removed hyperedge e^. For each other u € e,;, either 
u £ B, OT u is in the 2-core, or m = Vj for some j > i. When processing the edges in reverse removal order, 
i.e. et, . . . , ei; to process Cj we set x('^i) to the disjoint union of the sets xl'"); O'^sr all u G ei other than Vi. 
That is, a variable z € x(j'i) iff -^ S xC"*^) for an odd number of variables u G e^ other than Vi. Since e^ is the 
equation Vi = J2u&ei-ujtv ""■> ^y induction, Vi = J2w£v(vi) '"'• Note also that, by induction, x(^i) contains only 
vertices that are in B or the 2-core. Finally, note that possibly xi^^i) = 0; in that case, Vi = J2wexivi) ^ ~ 
in every satisfying assignment. 

Remark: It is not hard to adapt the proof of Lemma [5]to show that w.h.p. for every i, x(wi) 7^ 0- But 
that is not required for the purposes of this paper. 

If at this point we fix any satisfying assignment a to the 2-core variables, the variables in B will remain 
unrestricted. Moreover, since all non-core variables not in B are expressed in terms of some subset of 
variables in the union of the 2-core and B, the variables in B form a basis for the subspace corresponding 
to all possible extensions of cr. 

Since i? is a basis, there are exactly 21^' extensions of a to the non-core variables, one for each assignment 
to B. We can move between any two such extensions by changing the assignments to the variables of B, 
one-at-a-time. Each time we change a variable v £ B, in order to get to another satisfying assignment, we 
only need to change a subset of i?+(w) in the digraph D, because only variables u G i?+(w) can have v S x(")- 
Thus, by Lemma |8l^a), we can move between any two such solutions changing at most Qlogn variables at a 
time. 

To complete the proof of Theorem [51 we show that we can move between any two cycle-equivalent 
solutions to the 2-core by changing a small number of variables. We move between any two such solutions 
by switching one flippable cycle at a time. Switching a flippable cycle, S, may require also changing some 
non-core variables in Utig5-R^(w). By Lemma[8jb), this requires switching at most Qlogn variables. D 

4.1 Frozen variables 

We close this section by showing how the digraph D can also be used to determine all of the frozen variables. 
Recall that a variable is said to be frozen in a cluster, if it takes the same value in all assignments of the 
cluster. In general, e.g., in random fc-SAT, the set of frozen variables can differ from cluster to cluster. In 
random fc-XOR-SAT, though, the set of frozen variables depends only on the underlying hypergraph, i.e., 
is the same for all clusters. Specifically, the 2-core variables not in fiippable cycles are frozen by definition, 
whereas all 2-core variables in fiippable cycles are not frozen. Below, we determine which non-core variables 
are also frozen and, we will see, the answer is independent of the 2-core assignment. 

To obtain a basis for all core-solutions, prior to seeking a basis for the extensions wc form a basis for 
the possible assignments to the fiippable cycles. To do this, we first choose, for each flippable cycle C, one 
variable vq arbitrarily and use |C| — 1 edges of the cycle to eliminate all variables in C other than vc- Thus, 
for every variable v € C other than vc, we have v = vc + z, where z G {0, 1} depends on the cluster; we 
set x(^) = {f^c}- Clearly, for any cluster, the set of chosen variables forms a basis S for the subspace of its 
core solutions. Next, we apply the method, from the proof of Theorem [21 for finding a basis for the set of 



all extensions. The result is a set B containing S and non-core variables. 

Theorem 3. In every cluster, the frozen variables consist of the 2-core variables not in flippahle cycles, and 
the non-core variables whose set x does not contain any variable from, B. 

Proof. It suffices to prove that B is a basis for all solutions in the cluster. This follows trivially from the 
fact that S" is a basis for the core solutions of the cluster, and the fact that our procedure in the proof of 
Theorem [2] yields a basis for all possible extensions of any 2-corc solution. D 

5 Random hypergraphs and their cores 

We will use the configuration model of Bollobas [1] to generate a random fc-uniform hypcrgraph H with a 
given degree sequence. Suppose we are given the degree d{v) for each vertex f ; thus ^ d{v) = kE where E 
is the number of hyperedges. We take d{v) copies of each v, and we take a uniformly random partition of 
these kE vertex-copies into E sets of size k. This naturally yields a fc-uniform hypcrgraph, by mapping each 
fc-set to a hyperedge on the vertices whose copies are in the fc-set. Note that the hypcrgraph may contain 
loops (two copies of the same vertex in one hyperedge) and multiple edges (two identical hyperedges). It 
is well-known that the probability that this partition yields a simple hypcrgraph (i.e., one with no loops or 
multiple edges) is bounded below by a constant for degree sequenceq^ satisfying certain conditions: 

Definition 9. Say that a degree sequence S is nice if E = 8(n) and ^^, d{v){d{v) — 1) = 0{n). 

Every degree sequence we will consider will correspond to some subgraph of Hk{n,p) with a linear 
expected number of edges. Since, as is well-known, the degree sequence of such random hypergraphs is nice 
w.h.p., all the degree sequences we will consider will be nice. With this in mind, we will make heavy use of 
the following standard proposition and corollary as working in the configuration model is technically much 
easier than working with uniformly random hypergraphs with a given degree sequence. 

Proposition 10. If S is a nice degree sequence, then there exists e > such that the probability that a 
random hypcrgraph with degree sequence S in the configuration model is simple is at least e. 

This immediately yields: 

Corollary 11. If S is a nice degree sequence then: 

• If property Q holds w.h.p. for k-uniform hypergraphs with degree sequence S in the configuration model, 
then Q holds w.h.p. for uniformly random simple hypergraphs with degree sequence S . 

• For any random variable X, if E{X) — 0(1) for k-uniform hypergraphs with degree sequence S in the 
configuration model, then E{X) = 0(1) for uniformly random simple hypergraphs with degree sequence 
S. 

The following lemma will be very useful. Its exponential term is not tight, but will suffice for our purposes. 

Lemma 12. Consider a random k-uniform configuration with E edges, i.e., with total degree kE. For each 
i = 2,...,k, specify £i sets of i vertex-copies, and set L = X]i=2^j- '^^^ probability that each of these sets 
appears in some hyperedge, and no two appear in the same hyperedge is less than 



( ^^' A A /(fc-i)(^-2)...(fc-i + i)\ 

-p (^^^ j n y j^Er^ j 



^Clearly, wc are referring to a sequence of degree sequences Sn so that asymptotic statements are meaningful. We suppress 
this point though, throughout, to streamhne exposition. 



Proof. We choose the hyperedges of the configuration by processing the specified sets one-at-a-time. To 
process one of the £i sets of size i, we first choose one set member 7 arbitrarily and then randomly select 
the remaining fc — 1 vertex-copies of the hypercdgc containing 7. Every time we do this there are at least 
kE — kL yet unsclectcd vertex-copies. Thus, the probability we chose all other i — 1 members of the specified 
set is at most 

{k-l){k-2)...{k^i + l) {k-l){k^2)...{k-i + l) f E ^ '"^ 



{kE-kLf-^ {kEy-^ \E-L 

< 



(fc-l)(fc-2)...(fc-Z + f) kL/iE^L) 

(kEY~^ ^ 



since i < k. So the probability that all L tuples are chosen to be in a hyperedge is less than 

' / (fc-f)(fc-2)...(fc-. + l) y- J,L/iE^L))i, kLViE-L) , A / (fc-l)(fc-2)...(fc-Z + f) 



n 



5.1 Cores 

It is well-known that the r-core of a random fc-uniform hypergraph is uniformly random conditional on its 
degree sequence. See [52] for the case fc = 2, and [l^ for the nearly identical proof for general r. In fact, 
the same is true of the graph remaining after any number of iterations of the parallel stripping process. It 
is also straightforward (see e.g., jTB]) to show the following propositions. 

Proposition 13. Let H = Hk{n,p) be a random k-uniform hypergraph and let H = Hq, Hi, . . . be the 
sequence of hypergraphs produced by the parallel stripping process. 

(a) For every i > 0, Hi is uniformly random with respect to its degree sequence. 

(b) There exist functions po,pi, ■ ■ . such that for any fixed integer s, w.h.p. Hg contains pj{s)n + o{n) 
vertices of degree j and -i:(^,^-^^ jpj{s))n + o{n) edges. 

The following is a bit stronger than Proposition [T^ a') . 

Proposition 14. Let H = Hk{n,p) and for any i > 0, let Hi be the hypergraph produced by i rounds of the 
parallel stripping process. Expose V(Hi), and for each < j < r — I, expose Vj , the set of vertices of degree 
j in Hi. Thus V>r ~ ViHi) — U^^qV; is the set of vertices of degree at least r in Hi. Finally, expose E, the 
number of edges in Hi. The hypergraph Hi is uniformly random conditional on these parameters. 

Proof. Consider any two hypergraphs A, A' on V{Hi) which satisfy these parameters. Consider any hyper- 
graph H such that, if we apply the parallel stripping process for i rounds, then Hi = A. Create H' by 
replacing A, the graph induced by V{Hi), with A' . Since A, A' agree on Vq, ..., K_i, it is easy to see that 
applying the parallel stripping process to H' for i rounds will leave H'^ = A' . Furthermore, H, H' are both 
equally likely to be chosen as Hk(n,p), since they have the same number of edges. Thus A, A' are equally 
likely to he Hi. D 

Proposition [T3l allows us to use the configuration model to study Hi. We will begin by bounding the 
contribution of the highest degree vertices to the sum of the degrees. 

Lemma 15. For every 9 > there exists J = J{9) such that for all i, 

J 
51 deg^^(w) -^jPjii) < dn 

v€Hi j=l 
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Proof. By Proposition [T31 the difference is o{n) + J2vdcg {i.)>J '^^S-Hi(^')- Since Hi C Hq = H, tliis sum is 
bounded by the corresponding sum in H, i.e., X^jj-dcg iv)>j '^^Sni'^)- The fact that this sum can be made 
arbitrarily smaller than On for any > by taking J = J{9) sufficiently large, is well-known and follows from 
the facts that (i) for each constant i, the number of vertices of degree j in 7J is w.h.p. Xin+o{n) for a particular 
^i = Ai(c, k) and (ii) the number of hyperedges in H is highly concentrated around (^ X]i>i ^-^i)"-- I— ' 

The following similar bound will also be useful: 
Lemma 16. For every constant d and 6 > there exists J = J{d, 9) such that for all i, 



V- degg,(t;)! ^p j! 

^ {AcgH.{v) - d)\ ^ij-dyf'^'' 



< en 



Proof. The proof is almost identical to that of Lemma [15] but exploits the concentration of the number of 
d-stars in H, rather than of the number of edges. (Recall that a d-star is a set of d edges which contain a 
common vertex.) The concentration of the number of d-stars in H is easily established, e.g., by the Second 
Moment Method or Talagrand's Inequality. (Indeed, Lemma[T5]and its proof are special cases of this lemma 
and its proof for d ~ 1.) D 

For any fixed integers fc, r and real number A > 0, we write 

^rW - e-' J2 ^'A! and /fc,.(A) = /(A) = :^7t^ • 

i>r~l ^^ ' 

See [ISl [131 [ini [H] for proofs that the threshold for the appearance of an r-core in a random fc-uniform 
hypergraph Hk{n,p) with p = c/n''~^ is 

4,r = min/fe.^(A). 

We will see that /' has a unique and, thus, for c > c^. ^ the equation /(A) ~ c has two solutions. 

Definition 17. For c > c^. ^, let fi = ^i{c) denote the larger of the two solutions of f{X) = c. 

The following two propositions arc well-known; sec eg. |18j for proofs. 

Proposition 18. For every fixed j > r, w.h.p. the r-core contains {e^'^^^ /j\)n + o{n) vertices of degree j. 
Furthermore, w.h.p. the r-core contains {p/k)'^r{p)n + o{n) edges. 

Proposition 19. For every 6 > 0, there exists B — B{9) such that w.h.p. 

(a) Hb contains fewer than On vertices not in the r-core; 

(h) For each j > r, \pj{B) - e-^'pL^ /i\\ < 9. 
The following lemma will be critical for our analysis. 
Lemma 20. For every c > c^. ^, there exists C = ({k,r,c) > such that 

^ ' i>r-l 

where fi is the largest of the two roots of the equation fk,r{X) — c. 
Proof. 

/'(A)=0 ^ *.(A) = A(fc-l)*,(A)'^-^*;(A) ^ ^ ^ = (fc_i)^::_ . (2) 

Equation ([2]) yields c^ ^ = /(A*) for some A* satisfying the last equation in ([2]). For c > c*f^ ^, since /i = fj.{c) 
is the largest of the two roots of /(A) = c, it follows that /i > A*. The lemma now follows by noting that the 
RHS of ([!]) divided by the LHS is proportional to J2i>r-i ^^~^i — ' '^hich is clearly increasing with /.i. D 
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6 Preliminaries to the proof of Lemma [8] 

As we said above, we will choose a sufSciently large constant B, strip down to Hb, and then focus on 
R^{u) n Hb, making use of the fact that Hb is very close to the 2-core (by Proposition [TO)) . The following 
will be used to bound the number of vertices that are removed from i?+ (u) when stripping down to Hb ■ For 
integer s > 0, we use N'^{v) to denote the s-th neighborhood of w, i.e., the set of vertices within distance s 
from V. For any set of vertices A, N''{A) = IJuga N''{v). We consider a single vertex to be a connected set. 
A straightforward induction yields the following. 

Proposition 21. For any integer i and vertex u G Hi, R^{u) C N^{R^{u) n Hi). 

Lemma 22. For any c, s > 0, there exists T = r(c, s) such that in a random graph G{n,p) with p = c/n, 
w.h.p. for every connected subset A of vertices \N''(A)\ < r(|yl| + \ogn). 

Proof. We prove this for the case s — 1, i.e., that there is a constant 7 > 1 such that w.h.p. every connected 
subset of vertices A satisfies |iV(^)| < 7(|A| + logn). By iterating, we obtain that for every s > 1. every 
connected subset of vertices A satisfies |Af*(^)| < /s(|y4|) where 

fii^) = j{x + logn) 
fi+i{x) = j{fi{x)+\ogn), ioT i>l. 

A simple induction yields fi (x) < 7* (a; + i log n) and that yields the lemma with F — sj'^ . 

Given any set A of size a, the probability that A is connected is at most the expected number of spanning 
trees which is a'^~^(c/n)"~^. After conditioning that A is connected, the number of neighbors outside of A 
is distributed as Bin(a(ri — a), c/n). The probability that this exceeds z is at most 

For any 7 > 2, if |A^(A)| > j{\A\ + logn), then we must have |A^(A)\A| > ^7(1^1 + logn). Taking 7 > 4ec, 
the expected number of connected sets A satisfying this last inequality is at most 

aj \nJ a^ a^ \2i/^J 

for 7 sufficiently large. Multiplying by the n choices for a yields the lemma. D 

Lemma 23. Fix fc > 3 and let H = Hk{n,p) be a random k-uniform hypergraph with p = c/n , where 
c > c^ 2- '^^^ expected number of vertices in flippable cycles in the 2-core of H is 0{V). 

Proof. Let T) be the degree sequence of the 2-core of H . By Corollary [Til we can work in the configuration 
model. Recalling Definition [TTl Proposition [TSI and Lemma [201 w.h.p. 

(i) P has total degree 771 + o(n), where 7 = /i^r(M), 

(ii) V has A2n + o{n) vertices of degree 2, where A2 = e~^/i^/2, 

(iii) there exists C > such that 2(fc — 1)A2 < (1 — C)7- 

We first bound the expected number of flippable cycles of size a in the 2-corc. Let A = 7^ + o{n) be the 
total number of vertex copies, and let L = X2n + o{n) be the number of copies of degree 2 vertices. 

There are (^) choices for the connecting vertices, ^°~ '' ways to order them into a cycle, and 2° ways to 
align their vertex-copies. This yields a pairs {j/i, zi}, ...{j/a, z^} of vertex copies, each of which must land in 
a hyperedge. We process these pairs one-at-a-time, halting if we ever find that the pair does not land in a 
hyperedge. To process pair i, we ask only whether Zi lands in the same hyperedge as y^; if it does we do not 
expose the other vertex-copies in that hyperedge. Thus, prior to processing pair i, we have exposed exactly 
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2i — 2 vertex-copies, all of degree 2. There are fc — 1 other copies appearing in the same hyperedge as y^. 
Each of the A— (2i — 1) unexposed copies (not including i/i) is equally likely to be one of those copies (and, 
for A: > 3, the exposed copies also have positive probability). So the probability that Zi is one of them is at 
most (fc — 1)/(A — 2i + 1). So the expected number of flippable cycles of length a is at most: 

L\ (a-l)!,a A fc-1 ^ i A 2(fc - 1)(L - ,: + 1) 

9 ^ li A -%-^1 "^ 2ali 



A-2i + l 2'^ J--"- A-2i + l 



1=1 

By condition (iii) above, 2(fc- 1)L/(A- 1) < 1-iC, and so 2(fc- l)(L-i + 1)/(A- 2i + l) < 1 - ^C for each 
i, since L < i(A — 1). So the expected number is at most ^(1 — ^C)"; a-nd so the expected total number of 
vertices on flippable cycles is at most i J2a>ii^ ~ ^C)" = 0(1). D 

7 Proof of Lemma [8] above the r-core threshold 

Let H — Hk{n,p) be a random fc-uniform hypergraph with p — c/n^^^ and let H — Ho,Hi, ... be the 
sequence of hypergraphs produced by the parallel r-stripping process. We will choose a stripping sequence 
that is consistent with the parallel process; i.e., in our stripping sequence: for every i < j, the vertices 
deleted in round i of the parallel process come before the vertices deleted in round j of the parallel process. 
Let D be the digraph associated with this r-stripping sequence for H and recall that R'^{u) denotes the 
set of vertices reachable from a vertex u in D. 

7.1 Proof of part (a) 

Our main challenge is to prove the following lemma. The idea is that we will take B large enough so that by 
stripping down to Hb, Proposition 1191 gives us control of the degree sequence that remains, and Lemma [20l 
allows us to prove that a certain branching process involving long paths in a graph constructed from Hb 
dies out. 

Lemma 24. For every c> c^. ^ there exists B = B(c) and Q ~ Q{B, c) such that w.h.p. for every vertex u, 
\R+{u)r}HB\<Q\ogn. 

Proof of Lemma[Sj(a). Consider any vertex u. If u ^ Hb, then by Proposition 12 1[ R'^{u) C N^{u) in which 
case Lemma [22] immediately implies that |i?+(ii)| < r(l + logn) for some constant P ~ P(c,_B). 

Hue Hb, then R+{u) C N^{R+{u) n i^ij), by Proposition EH Since, by Lemma [Ml |-R+(u) ^ Hb\ < 
Qlogn, Lemma [221 now implies that |i?+(u)| < P(Qlogn -I- logri) = Zlogn for Z = FQ -I- 1 = Z{c,B). D 

For any i, we define Di to be the subdigraph of D induced by the vertices in Hi. We also define: 

Definition 25. For any vertex u € Di, let Si{u) denote R^{u) D Hi. 

Let T be a BFS tree in Di rooted at u, thus spanning the vertices of Si{u). Since T is a BFS tree, each 
vertex has indcgrce at most 1 in T implying that no two arcs of T were formed during the removal of the 
same hyperedge. From this point on, we will treat T as an undirected graph. 

The following technical lemma bounds the density of small subgraphs of Hk {n,p). Lemmas of this flavour 
arc very common in random graph theory. Given a subset S of the vertices of Hk{n,p), we let ij{S) denote 
the number of hyperedges that contain exactly j of the vertices of S, and we let L{S) ~ X]i=2(j ^ ^)^j- 

Lemma 26. For every C > 0, there is 6 > 0, such that w.h.p. every S with \S\ < 6n has L(S) < (1 + C)l'5'|- 

Proof. Rather than working in the Hk{n,p) model with p ~ c/n^^^ , where the expected number of edges is 
{c/k\)n + o{n), it will be convenient to work in the H^ {n, m) model, where exactly m ~ {c/k\)n edges are 
selected uniformly, independently and with replacement. Since m = 0{n), standard arguments imply that 
high probability properties in this model transfer to the Hf;[n,p) model. 
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Let Ya = YaiCj denote the number of sets S with \S\ = a and L{S) = (1 + 0\S\. We will bound E(F„) 
as follows. Define 

Choose a vertices and some {£2, ■■■,ik) € ^a-, pick ij edges for each j, and then multiply by the probability 
that each edge chooses (at least) the appropriate number of vertices from S. This yields 



-(-.) ^ (:) E n (") [0 {^r 



< I — 1 > I — I I I — — — , for some constant J = J(c,fc) > 



< It) u n e4t 



="£ 



/A \ ''° 

= ( I , for some constant A = A(c, fc, > 

\ n J 

Choosing 9 = 2^, it is standard and straightforward to show E {J2a<en ^a) ~ "(!)• ^ 

The following key lemma will allow us to bound the expected number of large trees T. 
Lemma 27. For every vertex u G Di, if T is the BFS tree in D^ rooted at u, then 

• For every i, 

(a) For every v € Si{u), v ^ u we have deg^. (u) < degg.i.^\{v) + r — 2. 

(b) For every leaf v ^ u of T, deg^. (v) < deg'ji(w) + r — 2. 

• For any 6 > 0, we can choose i ~ i(5) such that 

(c) w.h.p. degg/^\{v) = degrp{v) for all but at most S\Si{u)\ + 2 vertices v € Si{u). 

Proof. 

Part (a): Consider v ^ u. At the point in the stripping sequence that v will be removed, it will have 
degree at most r — 1. One of those r — 1 edges is in Si{u), since v has indegree 1. Every other edge of Hi 
containing v will be removed before v, therefore resulting in a directed edge out of v in Si{u). 

Part (b): Since u is a leaf in T, and v y^ u, v was initially dclctablc. Thus, it has degree at most r — 1 in 
Hi and it has degree at least 1 in T. 

Part (c): For 2 < j < k, let £j denote the number of hyperedges with j vertices in Si{u). All vertices 
of Si(u), except possibly u itself, are not in the r-core. So, by Lemma HQf a). we know that for any 9 > 
we can select i = i(9) such that |S'i(M)| < 9n. If we pick 9 sufficiently small in terms of 5, then Lemma [^ 
implies that w.h.p., EL20' ^ ^)^o < (1 + ^l'^)\Si{u)\. So 



k k 

E 

v&Si{u) j=2 3=2 



dcgs.(.)(«) - E J'^^- ^ 2E(J - 1)^J- < (2 + 5)1^.(^)1. 
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Now the total T-degree of the vertices m T is 2|S'i(M)| — 2. So 

E dcgs,M(«) - degT(«) < (2 + -5)1^.(^)1 - {2\S,{u)\ - 2) = 5\S,{u)\ + 2. 

v£S,(u) 

This proves part (c). D 

In the following we will need to take B sufficiently large for various bounds to hold. Let Xa = Xa{B) 
be the number of BFS trees T in Db with a vertices. Since we only need to show that there exists some 
constant Q > Q such that w.h.p. Xa = for a > Qlogn, in the following we will allow ourselves to assume 
that a is greater than some sufhciently large constant. 

To prove Lemma [^ we first observe that, by Proposition ll91 we can assume that Hb is uniformly random 
conditional on its degree sequence. Since Lenima [24l asserts a property to hold with high probability, it suffices 
to establish this property in the configuration model for Hb- Moreover, recall that by Proposition [TW b') as 
B is increased, w.h.p. the degree sequence of Hb tends to that of the core. 

Let vi, ...jVa be the vertices of T. We first specify di = degrp{vi) for each i, noting that these degrees 
must sum to 2a — 2. The number of ways to arrange these a vertices into a tree with a specified degree 
sequence is (a — 2)!/ Yl{di — 1)! and there are a choices for the root, u, of the tree. 

Next we choose the vertices of T. Then for each edge of T, we choose a vertex-copy of each of its 
endpoints. To do so, for each Vi, we choose a copy of Vi for each of the di edges in T incident with Vi. If 
'^'^§^13(^0 = J' then there are j!/(j — di)! choices for the di copies of Vi. Since deg^^(wi) > di, the number 
of choices corresponding to Vi is at most J^wdeg (w)>di '^6S_f/B(''^)V('^'^g_ffB(^) ~ '^V-- Applying Lemma [TBI 
with 9 — yS, and replacing J with 00, we obtain that this number is at most (Y{di) + v5)n where 

Y{d) = YB{d) = Y^-^^p,{B) . 
j>d ^^ '' 



Note that this bound holds uniformly, even when di grows with n. 

Furthermore, if deg5^(„)(ui) ~ dcgrp(vi), and if Ui 7^ u, then by Lemmal^Tla). di < deg^^(w) < di 
So in this case we can use Y'{di) rather than Y{di) where 



-r- 



j=d ^■' '' 

Using Y'{di) instead of Y{di) will be particularly useful when di < 2. 

By Lemma \T7\c) . for any S > we can take B = B{5) > sufficiently large, so that we must use Y{d) 
for at most 5a + 2 vertices Vi 7^ u, none of which have dcg^^ = 1 (since if degu^ = 1, by Lemma I^Tl b). we 
can use Y'). For convenience, we will assume a > 2/S so we can take 6a + 2 < 25a. Then we overcount by 
using Y{d) for exactly 25a degree 2 vertices, even if the number for which it is required is smaller. 

We will upper bound E(Xa) by using Y{d) for u and for every vertex Vi with di > 3. Let ii,i2,^3 
denote the number of vertices Vi for which di ^ 1, di = 2, di > 3, respectively. We note that {Y{degrp{u)) + 
V^)/(r'(deg-r(w)) + V^) < (y(l) + V5)/(Y'{1) + v^). Also, for sufficiently large rf, Y{d) is decreasing and 
so there is a constant d* such that for all d > 3, Y{d) < Y{d*). So, if we were to use Y'{2) for every degree 
2 vertex, then the overall contribution of the Y, Y' terms would be at most 

^^±^[{Y\l) + V5)nr^ ■ [{Y'{2) + V~5)nr^ ■ [{Y{d*) + ^/^)r.]*^ . 

However, to account for the 25a vertices for which we use Y{2), we multiply by the (j'l^j) < {,25a) choices 
for those vertices, and we multiply by T where 

^_,y{2) Y{2) + V~5 
Y'{2) Y'{2) + V~5 ' 
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for S sufficiently small. This brings the overall contribution of the Y,Y' terms to at most: 

= 0(n)[(r'(f) + V^)n]*i-if " ]T^^''[{Y\2) + VS)nY--[{Y{d*) + VS)nY\ 

\2oaJ 

Finally, we multiply by the probability that each of the a — 1 pairs of vertex-copies corresponding to 
edges of T, lands in a hyperedgc of the configuration and divide by the a! rearrangements of the vertices. 
Because T is a BFS tree rooted at u, each vertex has indcgrce in T at most 1. Thus, no two edges of T were 
formed from the same hyperedge. So, wc can apply Lemma [T2l to the a — 1 specified pairs of vertex-copies 
and multiply by (■^^) e'^° /(E-a) ^q gg^ ^j^ overall bound, where E is the number of edges in Hb- 

Recall that for c > d^ j., ^ = fj,{c) denotes the larger of the two solutions of /(A) = c. By Proposition [T9l 
and Lemma [TSl for any 5 > 0, we can take B sufficiently large so that 



kE-^i Y. 



e-^/x^ 



j>r-l 



< Sn 



Our key Lemma [501 now yields that by taking B sufficiently large, we can have S sufficiently small in 
terms of C that various bounds below hold, including 



^'^'^^ •s]t^<i^i 



(r-2)! 



kE/n 



(3) 



By Lemmata [16] and [TH for any J > 0, we can take B sufficiently large so that Y'{1) < S/2 and 

Y'{2) < ^^ + 5/2. So, y'(l) + VS,Y'{2) + VS are bounded above by 6 and f^^ + S, respectively. We 

let * = 2Y{d*) > Y{d*) + V5i0T6 sufficiently smaU. 
Putting all this together yields 



E{Xa) < 0{n)[^^^jT'"^ 



kE 



^ka^/iE-a) 



< 0(n/a)e'^-"'/(^-") 



di + ...+da=2a-2 

1 






+ S]n 



(*n)* 



a(a-2)! 



«!n:=i(rf.-i) 



Y2<5a f 2_ 



1 



(2<5)2'5(l-2(5)i-2'5; ^ \kE/n 



E 



di + ...+da=2a-2 



(7^2)! 



,(25)2'5(l-2(5)l-2<5 

For 6 sufficiently small in terms of (, 



kE/n 



E 



di+...+da=2a-2 



(7^2)! 



\I/' 



(2,5)2<5(l-2(5)i-2'5 ^ "^+ 10 ■ 
Since this is the degree sequence of a tree, we have ii > is, and so since 5 < \, 



E{Xa) < 0{n/a)e''-'/^''--^(l + ^) 



E 



i + ...+da=2a-2 



v^. 



fc-1 

kE/n 



(7^2)! 



k-l 

kE/n 



\/^* 



fc-1 

kE/n 



(4) 



v]/* 
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We choose S sufficiently small in terms of C so that 



kE/n' kE/n 100 



This and © yield 



^ \ a / ^\ *2 / ^ \ a—t2 

EiX.) < OKa)e-^/(-^) ( 1 + i^ ) ^ (l - (^ 

^ di+...+da=2a~2 ^ / \ ' 



Now we fix ^2 and count the number of choices for di, ...,da. There arc (^^j choices for the values of i 
with di = 2. The remaining a — t2 degrees sum to 2a — 2 — 2^2- The number of choices for sequences of 
y non-negative integers that sum to z is (^ _7 ) ' ®^ ^^^ number of choices for these degrees is bounded by 

C^a-u-'i'') < 22('^-*=)-3 < 4'^-*^ Thus, 



E{x^) < oK«)e'^-v(^"'^) (i+^yt: (;J4-*^ (^1 _ p^v C 



£2=0 



t2 / /- \ a—t2 

100, 



OKa)e'^-'/(^-'') fl + 4V fl - ^ + :f 



10 



2 25 



< 0(n/a)e'^-'/(^-'') ("l - ^ y 



< 0(?i/a) 1 
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(5) 



a) is sufficiently small that 
logn^a) = 0(1) a'^d, therefore, 



where the last inequality holds for all a such that ka/{E — a) = ka/{pn + o(n) 

gfca/(B-a) < 1 + |. Thus, there are constants Q,^ > such that ^.{YT 

w.h.p. there are no trees of size between Q logn and ^n. Note now that ^ depends only on C, which, in turn, 

depends only on c,r. Therefore, we can always select (5 < ^. RecalHng that, using Proposition [TOla). we 

chose B large enough that Hb contains fewer than 5n vertices outside of the r-core, this implies that there 

are no trees of size at least ^n. 

D 



7.2 Proof of part (b) 

Consider any flippable cycle C with vertices ui, ..., u^. In our directed graph Z?, add edges from Uj to Uj+i 
for each j (addition mod t). Thus, R^{ui) = U^^]^i?'^(uj). We modify the arguments from the proof of part 
(a) for this setting. 

Again, we let T be a BFS tree, this time rooted at Mi. Note that since each Uj has degree exactly 2 in 
the 2-core, it follows that for degj:^^(uj) = Aegg^{uj) for each j, and for j = 1,£, degg^{uj) = degrp{uj) + 1. 
Thus Lemma I^TT a) still holds. The only members of C that can be leaves of T are ui, U£, so at most one leaf 
of T violates Lemma [TTT b). Finally, by Lemma [^ we can restrict our attention to ^ = o{n). Since ui, ..., ui 
are the only 2-core vertices in Sb{Ui), we can maintain [^^(ui)! < 9n and the proof of Lemma [27rc) still 
holds. 

As in part(a), we bound the expected number of such trees of size a; ui is the root and hence plays the 
role of u from part (a). This time, T has the additional property that there is an edge in D from a vertex of T 
(i.e. Uf) to ui. To account for this additional property. We adjust ^ as follows: (i) multiply by the number 
of choices of one of the a— 1 other vertices to be uf, (ii) account for the fact that deg^^ (ui) = degr(ui) -f 1, for 

i = 1,£; (iii) choose vertex-copies for the extra edge from ue to ui; (iv) adjust the term (-^i^) e'°° /{E-a) 
which, by Lemma 1121 bounded the probability that the a — 1 pairs of vertex-copies corresponding to edges 
of T each landed in a hyperedge of the configuration. 
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For (ii) and (iii), we use Y{d{uj) + 1) instead of Y{d{uj)) or Y'{d{uj)) for j = 1,£. Recall that in 
part (a), we used y(deg(M)) instead of Y' {deg{u)); since Y{d{ui) + 1) < Y'(deg(Mi)), the adjustment for ui 
is a decrease. The adjustment for ug is an increase of a multiplicative factor of at most {Y{d{ue) + 1) + 
V5)/{Y'{deg{ue)) + VS)< {Y{d*) + VS)/{Y'{1) + VS) = 0(1). 

For (iv), the hyperedge containing ui, ui is in the 2-core and so is distinct from the other a—1 hyperedges. 
This results in another multiplicative factor of ^^ to account for that edge, when applying Lemma [T^ 

The net result is to multipy E{Xa) by 0{a/n), and so the bound on E(Xa) in ([5]) becomes 0(1) ( 1 ^ ^ 

Summing over all a yields that the expected number of flippable cycles C such that \\Ju<^c R^ {u)C^H b \ > uj{n) 
is o(l) for any Lu{n) -> oo. Proposition 1211 and Lemma [22] complete the proof. D 

8 Proof of Lemma [8] below the r-core threshold 

As in the case for c > c^. ^, we will carry out a large but fixed number, /, of rounds of the parallel r-stripping 
process, ending up with a hypergraph Hj. Because we are below the r-core threshold, this will delete all 
but a very small, albeit linear, number of vertices. Proposition [T3] asserts that the remaining hypergraph 
is uniformly random conditional on its degree sequence. We will determine this degree sequence and apply 
the technique from |19j to show that the maximum component size in the remaining hypergraph has size 
O(logn). Thus, for every v, we must have |i?+(w) n Hj\ = O(logn). Proposition [211 and Lemma [22] then 
implies that |i?+(w)| = O(logn) as required. 

In what follows, we will describe the proof for the case r ~ 2 and fc > 3, as this is all that is required for 
Theorem [21 and we write c^. 2 = c^.. The proof for general r is a straightforward adaption. 

We first focus on determining the degree sequence, po, pi, ... oi Proposition[T3] Let Po(/i) denote a Poisson 
variable with mean /i and recursively define the following quantities: 

HO) = 1 

A,; : 



(fc-1)! 
^,+1 = Pr[Po(A,) > 1] 
z, = Pr[Po(A,_i) > 2] 



Lemma 28. 



pi(i) = Pr[Po(A,) = 1] -Pr 



Po ( £(fi^i_fll 1 > 1 



(6) 



(fc-1) 
p,{i) = Pr[Po(A,)=j] ,forj>2. (7) 

Proof. For j > 2, a vertex v has degree j in Hi iff it has exactly j neighbours that survived the first i rounds. 
As proven, for example, in [18] this occurs with probability that tends to Pr[Po(Ai) = j] as n — > 00. 

A vertex v has degree 1 in Hi iff it has exactly one neighbour that survived the first i rounds and at 
least one neighbour that survived the first i — 1 rounds but not the ith round (otherwise v would have been 
deleted during round i or earlier). Arguing very similarly to the proof of (O in [IB], it follows that this 
occurs with probability that tends to the right hand side of (jG)) as n — > cxj. D 

A simple adaptation of the proof of the main result of |19j provides a hypergraph version. Proposition [13] 
and Lemma [16] allow us to apply that hypergraph version to deduce that if 

{k - 1) ^ j(j - i)p, w < E-?'/'^(^)' i-^-' /'i(^) > E((^ - i)j'(^' - 1) - ^>j(^) ' 

j>l j>l j>3 

then w.h.p. all components of Hj have size 0(log7i). 
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As discussed in [H], if c < c^. then liuii^ao (f'i'i) = 0. (Indeed, it is not hard to see that cl is defined 
to be the largest c for which the hmit of is 0.) Thus, by taking i sufficiently large, we can take 0i to be 
arbitrarily small. It will be useful to develop the following asymptotics as Ai_i = A -t- 0: 

z, = l-e-^(l + A) 

= 1-[1-A + AV2-0(A3)](1 + A) 

= y + 0(A3) . 

(|)^ = 1- e-^ 
= A + 0(A2) . 

- ^ + o(d) 



= e(A2/(^-i)) . 

From the above, for k > 2, it easily follows that for A sufficiently small, Zi_i — Zi > \zi-i — 8(A^'''-'^^^'), 
and so 

PiW = A.e-^' (^l_exp(-fei^)) =A.e-^' xeiA^/C^"!)) . 
At the same time, since Ai — >■ 0, a series expansion easily gives the second equality below 



E((^ - 1)jO' - 1) - ^>^(*) = E^'(J' - 2) Pr[Po(AO = j] = e(p3(*)) = e(A,?)e 
i>3 i>3 



-Ai 



Thus, for any e > 0, if i is sufficiently large, we have pi(i) > (1 + e) X)i>3((^ ~ 1)jO ^ 1) ^ i)Pj(^)j since 
A.A^/C^-i) » Af . 

9 Proof of Theorem [1] 

Given a satisfying assignment, we say that a set S of variables is flippable if changing the assignment of every 
variable in 5 results in another satisfying assignment. A flippable set is minimal if it does not contain a 
flippable proper subset. Note that flippable sets can be characterized in terms of the underlying hypergraph. 

Proposition 29. S is flippable iff every hyperedge contains an even number of members of S . 

Thus, recalling Definition [2l a fiippable cycle is a flippable set. Theorem [1] follows from the following. 

Lemma 30. Let H be a random k-uniform hypergraph Hk{n,p), where p = c/n . For every c > cjl 2 there 
exists a > such that ui.h.p. every minimal flippable set in the 2- core of H either is a flippable cycle or has 
size at least an. 

If we could show (deterministically) that the hypergraph induced by any minimal fiippable set in a 2-core 
that is not a fiippable cycle is sufficiently dense, then Lemma [5(11 would follow by a rather standard argument. 
Unfortunately, there is no useful lower bound on the density, mainly because of the possibility of very long 
2-linked paths in S (defined below). Instead, we follow an approach akin to that of ^U\, forming a graph 
r(S') by contracting those long paths, and making use of the fact that T{S) is dense (Lemma [35)) . While the 
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basic idea is similar to |20] . the computations are significantly more challenging as we have to carry out the 
proof in the configuration model. 

To prove Lcnnira [5Dl wc first require a few definitions. Note that these concern any hypcrgraph, not just 
a random one. 

Definition 31. Let % be a k-uniform hypergraph. A 2-linked path of a set S C V^('H) is a set of vertices 
VoT--,Vt G S and hyperedges ei,...,et such that 

(i) VQ,...,Vt are all distinct except that possibly vq = Vt. 

(a) Each Ci contains Vi-i,Vi and no other vertices of S . 

(Hi) vi, ...,Vt-i all have degree 2 inH. 

(iv) Each ofvo,Vt either has degree ^ 2 inH, or lies in a hyperedge not containing exactly 2 members of 
S. 

We call vo,Vt the cndpoints of the path and vi, ...,Vt-i its connecting vertices. 

Note that if vq ~ wt then (iv) implies that deg^(wo) > 2 and hence vq, ■■■, vt do not form a flippable cycle. 
Note also that a loop, i.e., a hyperedge containing a vertex v twice, can yield a 2-linked path with t = 1 and 
Vq = vi = V ii V G S , no other vertices of the edge are in S, and deg^(z;) > 2. If deg^(w) = 2 then ii is a 
flippable cycle. 

Definition 32. We say that S C VlH) is a linked set if (i) S does not contain a flippable cycle as a subset, 
(ii) no hyperedge of % contains exactly one element of S and (Hi) every hyperedge e of % with |en 5*1 =2 
is in a 2-linked path of S . 

Proposition 33. Suppose S is a flippable set which does not contain a flippable cycle as a subset. Then S 
is a linked set. 

Proof. By Proposition [521 '^e only need to check condition (iii). Consider any hyperedge e with |e n S"! = 2. 
Either e itself forms a 2-linked path in 5, or it is easily seen that e can be extended into such a path, unless 
e lies in a flippable cycle. D 

Remark: It is easy to see that in any Uniquely Extendible CSP, the set of disagreeing variables of any 
two solutions must be a flippable set. Since Proposition [33] was derived by only considering the underlying 
hypergraph (and not the speciflc constraints), it applies to any UE CSP. Therefore, our Theorem [1] extends 
readily to all UE CSP since its proof amounts to proving that for some constant a > 0, all linked sets are 
either flippable cycles or contain at least an variables. 

Given a linked set, 5*, we consider the mixed hypergraph T{S) formed as follows: 

(a) The vertices of T{S) are the endpoints of the 2-linked paths in S along with all vertices of S that do 
not lie in any 2-linked paths. 

(b) There is an edge in r(S') between the endpoints of each 2-linked path in S. 

(c) For every hyperedge e oiV. with |e n 5*1 > 2, e n 5 is a hyperedge of T{S). 

Thus y(r(5')) C 5", and since no hyperedge of C contains exactly one element of S, for every v e y(r(S')) 
we have degp(5)(w) = deg^(w). Any vertex of S that is not in T{S) is a connecting vertex of a 2-linked path 
in S. 

Proposition 34. If S is a non-empty linked set, then T{S) has at least one vertex. 

Proof. Any vertex of S that is not in T(S) is a connecting vertex of a 2-linked path in S. The endpoints of 
that 2-linked path are in T{S). D 
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Note that T{S) contains hyperedges of size between 2 and k. For each 2 < z < fc, we define £i to be the 
number of z-edges in r(5'). 

Lemma 35. // every vertex in Ti has degree at least 2 then X]i'=2(* ~ ^)^i — (1 + 2fc)l^(^('^)l' 

Proof. As we said above, every v S F(r(5')) has the same degree in r(S') as it does in H. Thus T{S) has 
minimum degree at least 2. Consider any v of degree 2 in r(S'). Then v has degree 2 in ?^ and hence cannot 
be the endpoint of a 2-hnked path in S, unless v lies in at least one hyperedge of H containing more than 
2 members of S. It follows that v lies in at least one hyperedge of r(iS') of size greater than 2. Therefore, 
at most J2i=3 *^i < ^ X]z=3 ^i vertices of r(5') have degree 2, and so letting Z denote the number of vertices 
with degree at least 3 in r(5'), we have 



\v{r{s))\ <z + fc^£, <fc[z + ^£. 



i=3 \ 2—3 

By the handshaking lemma, J2i=2 *^« ~ Su dcgr(5)(w). Therefore, 



i=2 V i=2 

k 



V i=2 

> ^Edegr(5)(") + ^E^' 

V 2 — 3 

V V i—3 

> |T/(r(5))| + iz+i^f, , since degr(s)(„) > 2 for ah t; 



i=3 

> (i + ^)inr(5))l- 

n 

Let C be the 2-corc of iJ = Hk{n,p). We will apply Lemma [35] with H = C to prove: 

Lemma 36. There exists a > such that w.h.p. C has no linked set of size less than an. 

Lemma [30l follows immediately from Lemma [36l The proof of Lemma [36] will be reminiscent of the proof 
of Lemma [21] but significantly more complicated because (i) we arc working in the configuration model and 
(ii) where we had £2 2-edges in Lemma I36[ we have £2 2-linkcd paths here. First, we provide a technical 
lemma. 

Lemma 37. For any integers a, t, given a set of a vertices in H ^ Hk{n,p), with p = c/n^^^ the probability 
that their total degree exceeds tkca is at most (e/t) 

Proof. Given a set A of a vertices, let Ea denote the number of hyperedges containing at least one member 
of A. The total degree in A is at most kEA- The number of potential edges in Ea is at most a(j,"]^) < an'^"^ , 
and so Ea is dominated from above by Bin(an'^~^,c/n'^~^) and using (") < {ne/zY we get 

Pr [Bin(an^-i,c/n'^-'i) > act] < ('""'"''j (^)°'* < (e/t)"^*. 

D 
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Proof of Lemma [BR By Corollary [TTJ we can work in the configuration model. Let D be the degree 
sequence of C. Recalling Definition [171 Proposition [TSl and our key Lemma [20l we have w.h.p. 

(i) V has total degree jn + o{n), where 7 = fj.'^rif^), 

(ii) V has A2n + o{n) vertices of degree 2, where A2 = e^^/i^/2, 

(iii) there exists C > such that 2(fc - 1)A2 < (1 - C)7- 

For each a > 1, let Xa denote the number of linked sets S* in C for which |r(5)| = a and let X = X]a=i -^a- 
Define 

Ca = I ii2, ..., 4) : (1 + ^) « < E(* - 1)^' - (^ + ^) " + (^ " ^4 ■ 

By Lemma [551 for any linked set S* in C of size a, there is some {£2-, ■•■,^fc) G 'Ca so that r(S') contains at 
least £i z-edges for each i. 

To bound E{Xa), we begin by choosing a vertices, A C V{C) and sum over alH > of the probability 
that their total degree in C lies in the range {tkca, {t + l)kca]. For each t, we upper bound this last probability 
by the probability that their total degree in H lies in {tkca, 00]. Moreover, to sum over all subsets A C V{C) 
we overcount by summing instead over all such A C V{H), and using Lemma[371 Of course, if such a set is 
not a subset of C then the probability of it contributing to Xa is zero. So this provides an upperbound on 

Given A, we sum over all possibilities for the values of (^2, •■•,^A;) & C.a- For each r > 2, we choose ir 
r-sets of vertex-copies belonging to vertices of A. If the total degree of A is in (tkca, it + \)kca\ then the 
number of choices for these ir r — sets is at most 

((t + l)fccarV''/^^, <i(^ + l)fccar'■ 



Denote the £2 2-sets as {ui,wi}, ...,{ui^,Wi2}. For each i = l,...,^2j we select ji > 0, the number of 
connecting variables in the 2-linked path from Ui to Wi , we choose the ji degree two connecting variables for 
that path, and we choose one of the two possible orientations of the vertex-copies of each of those connecting 
variables. Let J = ji + ... + ji^, be the number of connecting variables selected. Let L = X2n + o{n) be 
the number of degree 2 vertices in C. Then the total number of choices for the connecting vertices and the 
orientations of their copies is at most ni=i 2(i — i + 1). 

Next, we apply Lemma [12] to bound the probability that the £3 + ... + £k sets of size at least 3 all land 
in hypcredges of the configuration and that for each i = 1, ...,£2, the first pair in the 2-linked path, i.e., Ui 
and the first copy of the first of the ji connecting variables, lands in a hyperedge of the configuration. Note 
that £2 + ... + £k l£ X]i=2(* ^ 1)^« < 2a + o{n). By assuming a < an for some sufficiently small a, we get 
771 — 2a + o{n) > ^771. Therefore, Lemma [T2| vields that this probability is at most 

n{£2 + ...4)' A A / (fc - i)(fc - 2)...(fc -t + i) 

^""P^ i7n Jl\\ {jn + o{n)y-^ 

/ 8ka^ + o{n) \ -A- Hk - l){k - 2)...{k - i + 1) 
\ 771 / ~_ \ {jn + o{n)Y^^ 

8ka^\'/k^^'~'^'' 



< exp p^ n - 



771 / -^j^ X'^n 

Following the analysis of Lemma [121 '^e have now exposed £2 + ... + £k hyperedges of the configuration. 
Let A be the number of unmatched vertex-copies remaining. Since £2 + ■■■ + £k < 2a + o{n), we have 
A > 771 — 2ka + o{n). If the other vertex-copies required for the 2-linked paths are still unmatched, then we 
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continue; else we halt observing that in this case, the set of choices made so far cannot lead to a linked set 
on the chosen vertices. 

There are J pairs of vertex copies that each need to be in a hypercdge of the configuration in order 
to complete the 2-linked paths. Following the same argument as in Lemma [531 the probability of this 
happcnning is at most W^^i ._T7-^-,x . Applying (iii) above, wc obtain the second inequality below 

2(fc- l)L 2{k - l)\2n + o{n) C 

A 7rt - 2ka + o(n) 2 ' 

if a < an for a sufficiently small in terms of 7, A2. Since 2(fc — 1)L < A and k < 2(fc — 1), we have 
' A-fc(t^ir < 1 — I for each i, leading to 

k 



E{Xa) < (")E (-)*'" E E e«^''^/(^")(n^^^±^l^^^ 



r . ^_. . ^_ ■»■-": lA 



t>0 t2,....tk^Caji,---,ji^>Q \i=2 

fc 
X 

i=2 

^ (l.^„\h /Z,2„„\ ('-1) 



n(|,)'""1(n -^'^^rj;r" ' 



" t>o ^ f2....AeCa \^=2 **■ V ^" / / 

jl,---,jt2>0 

Since J = i, + ... + J,,, we have E„ ,„>o(l - </2)' = (E,>(,(1 - C/2)')'' = (2/C)''. we get 

f2,...AeCa V ^" / V«=2 **■ / '' t>0 ^ 

By our choice of Ca 

fc 



4=2 

fc fe 



^i^,; < 2^(i-l)£,; <3a + 2fc, 

i=2 i=2 

we obtain (2/C)^" < ^'f and 



t>Q t>0 t>Q 



for constants Zi — Zi{c), Zi — ^2(2), since {ejty-'^it + l)3+2fc jg decreasing for large t. Also using a < n we 
obtain 



i+Tr)a+fe-i fe 



e'^\''„8fca/7/v V Na /^fc ca\V 2fc; ,^ -f-f {kca 



E(.Y.) < (^) .-/,(2,z,,,. :L^r Y. n 

oa,|.z..,e.v.(f)'^*)°(£)-(|:<^ 



fc-i 
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Applying (e,>o ^) = e'=^"('=-i) we get 



for a constant F = Y('j,X2,C,b,£,) that does not depend on a, so long as a < an for sufBciently small 
a > 0. This yields E{J2'^i^a) = o(l). Moreover, for all a sufficiently smah, E{Xa) < 2'" . Therefore, 

IE(Ea>ys^a) = o(l) and, thus, E(X) = o(l). D 
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